A Realizable Eecient Parallel Architecture
نویسنده
چکیده
The near future will present large scale parallel computers, able to provide computing power of more than one TFlop per second. It is commonly agreed that these systems will be based on the model of asynchronous processors connected by a point to point network. There are a number of diierent network architectures presented in the past. In this paper we present an architectural principle that combines eeciency, realizability for very large systems, and inherent reliability needed for such large parallel processing systems. The here presented Fat Mesh of Clos network principle can be scaled in many ways to fullll the special requirements of a system design. Two realizations of this principle are presented: One is based on static switches combined to form a fully reconngurable system. This architecture has been realized for systems containing up to 320 processors. The other realization uses dynamic routing switches. By combining wormhole routing with randomized and local adaptive routing this network provides large capacity and very short latency times. The eeciency of our principle is demonstrated by simulations. Both realizations presented here are built and commercialized by Parsytec Computer.
منابع مشابه
An Eecient Vlsi-architecture for Real-time Motion Estimation
An eecient VLSI-architecture that enables hierarchical motion estimation at real time for large tracking areas in applications such as HDTV is presented here. This is capable of treating (16*16)-blocks, with a maximum displacement of +/-14 pixels, at 72 MHz pixel rate. The architecture is based on the the hierarchical three-step search algorithm with a data-ow design which allows sequential inp...
متن کاملParallel Architectures : Design and E cient Use ?
In this paper we want to demonstrate the large impact of theoretical considerations on the design and eecient use of parallel machines. We describe interconnection networks for parallel computers, tools for their eecient use (mapping, load balancing) and the parallelization of a problem which is hard to parallelize (chess programming).
متن کاملProject Triton: towards Improved Programmability of Parallel Computers Compilation Techniques. Triton/1 Parallel Architecture
This paper appeard in: The main objective of Project Triton is adequate programmability of massively parallel computers. This goal can be achieved by tightly coupling the design of programming languages and parallel hardware. The approach taken in the Project Triton is to let high-level, machine independent parallel programming languages drive the design of parallel hardware. This approach perm...
متن کاملEecient Communication Operations in Reconngurable Parallel Computers Eecient Communication Operations in Reconngurable Parallel Computers
Reconnguration is largely an unexplored property in the context of parallel models of computation. However, it is a powerful concept as far as massively parallel architectures are concerned, because it overcomes the constraints due to the bissection width arising in most of distributed memory machines. In this paper, we show how to use reconnguration in order to improve communication operations...
متن کاملA High-Speed VLSI chip for Parallel Image Decorrelation
We present here the architecture and design of a special purpose CMOS VLSI chip for high-speed parallel image decorrelation/inverse decorrelation scheme which is to be used for eecient real-time losslesss image compression/ decompression. The chip is designed for one processing element of the parallel architecture that performs the image decorrelation/inverse decorrelation. The architecture is ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1992